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ABSTRACT 



Methods and systems for processing pixels within an encoded 
video stream are disclosed herein. Aspects of the method may 
comprise acquiring on a chip, a prediction weight type from 
the encoded video stream. Prediction weight data for at least 
one prediction pixel within the encoded video stream may be 
acquired based on the prediction weight type. A look-up table 
entry in a look-up table may be generated for the prediction 
pixel utilizing the prediction weight data. A plurality of cur- 
rent pixels within the encoded video stream may be estimated 
utilizing the look-up table entry. The prediction weight type 
may correspond to a plurality of reference pictures within the 
encoded video stream. A prediction weight value, offset value 
and motion vector data for the prediction pixel may be 
acquired on the chip, if the prediction weight type comprises 
an explicit prediction weight type. 

39 Claims, 6 Drawing Sheets 
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VIDEO DECODING FOR MOTION 
COMPENSATION WITH WEIGHTED 
PREDICTION 
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DEVELOPMENT 
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MICROFICHE/COPYRIGHT REFERENCE 

[Not Applicable] 

BACKGROUND OF THE INVENTION 

During video encoding, an exemplary video stream may be 
encoded on a frame-by-frame basis, and each frame may be 
encoded on a macroblock-by-macroblock basis. The 
MPEG-4 video compression standard may be utilized to 
encode video information on a macroblock-by-macroblock 
basis. For example, the MPEG-4 standard specifies the use of 
spatial prediction, temporal prediction, discrete cosine trans- 
formation (DCT), interlaced coding, and lossless entropy 
coding to compress macroblocks within a video stream. Spa- 
tial prediction techniques may be utilized during decoding to 
estimate pixels based on neighboring pixels. For example, 
pixels in a current macroblock may be estimated utilizing 



spatial prediction from pixels within a neighboring macrob- 
lock to the top and/or to the left of the current macroblock. 
Temporal prediction techniques may also be utilized during 
decoding of an elementary video stream to estimate pixels in 

5 a current macroblock based on pixels in different previous or 
subsequent frames and/or reference pictures. 

During decoding of temporally predicted video informa- 
tion, explicit and/or implicit prediction data may be utilized to 
decode temporally predicted pixels within an elementary 

10 video stream. A conventional decoder utilizes separate hard- 
ware blocks, including separate memories, to implement tem- 
poral prediction of video information based on explicit and 
implicit prediction data. In addition, explicit prediction data, 
which may be encoded directly in the elementary video 

15 stream, may be acquired from the video stream and commu- 
nicated to different processing blocks within the decoder for 
further processing. The utilization of separate hardware pro- 
cessing blocks within a conventional decoder to acquire or 
generate prediction data and process the acquired or gener- 

20 ated prediction data separately to predict each encoded pixel, 
results in a significant increase in the processing time neces- 
sary for decoding an elementary video stream and, therefore, 
a decrease in the efficiency of the video decoder. 

Further limitations and disadvantages of conventional and 

25 traditional approaches will become apparent to one of ordi- 
nary skill in the art through comparison of such systems with 
the present invention as set forth in the remainder of the 
present application with reference to the drawings. 



30 



BRIEF SUMMARY OF THE INVENTION 



Certain embodiments of the invention may be found in a 
method and system for processing pixels within an encoded 

35 video stream. Aspects of the method may comprise acquiring 
on a chip, a prediction weight type from the encoded video 
stream. Prediction weight data for at least one prediction pixel 
within the encoded video stream may be acquired based on 
the prediction weight type. A look-up table entry in a look-up 

4Q table may be generated for the prediction pixel utilizing the 
prediction weight data. A plurality of current pixels within the 
encoded video stream may be estimated utilizing the look-up 
table entry. The prediction weight type may correspond to a 
plurality of reference pictures within the encoded video 
stream. 

45 

A prediction weight value, offset value and motion vector 
data for the prediction pixel may be acquired on the chip, if 
the prediction weight type comprises an explicit prediction 
weight type. The look-up table entry may be generated on the 

50 chip utilizing the prediction weight value and offset value for 
the prediction pixel. The generated look-up table entry may 
correspond to a luminance and/or chrominance components 
of the prediction pixel. The look-up table entry may be 
indexed utilizing the motion vector data. The current pixels 

55 may be estimated utilizing the indexed look-up table entry. 
The acquired motion vector data, prediction weight value and 
offset value may be stripped off from the encoded video 
stream. 

If the prediction weight type comprises an implicit predic- 
60 tion weight type, motion vector data for the prediction pixel 
may be acquired. A prediction weight value for the prediction 
pixel may then be generated, where the prediction weight 
value may be based on a temporal distance between pixels 
within the encoded video stream. The look-up table entry may 
65 be generated on the chip utilizing the prediction weight value . 
The current pixels may be estimated utilizing the generated 
look-up table entry. 
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Another aspect of the method may comprise receiving the 
encoded video stream and acquiring prediction data from the 
received encoded video stream. A table may be generated 
during the acquiring of the prediction data based on the 
acquired prediction data for estimating a current pixel based 5 
on a previous pixel and a subsequent pixel. The generated 
table may be updated for a plurality of frames. If the acquired 
data comprises explicit prediction data, prediction weight 
information, offset and motion vector information may be 
acquired from the encoded video stream. If the acquired data 10 
comprises implicit prediction data, motion vector informa- 
tion may be acquired from the encoded video stream. Predic- 
tion weight information corresponding to the acquired 
motion vector information may be generated based on tem- 
poral distances between pixels in a current frame and a ref- 15 
erence frame. 

Another embodiment of the invention may provide a 
machine-readable storage, having stored thereon, a computer 
program having at least one code section executable by a 
machine, thereby causing the machine to perform the steps as 20 
described above for processing pixels within an encoded 
video stream. 

Aspects of the system may comprise at least one processor 
that acquires on a chip, prediction weight type from encoded 
video stream. Prediction weight data for a prediction pixel 25 
within the encoded video stream may be acquired by the 
processor based on the prediction weight type. A look-up 
table entry in a look-up table may be generated by the pro- 
cessor for the prediction pixel utilizing the prediction weight 
data. A current pixel within the encoded video stream may be 30 
estimated by the processor utilizing the look-up table entry. 
The prediction weight type may correspond to a plurality of 
reference pictures within the encoded video stream. 

If the prediction weight type comprises an explicit predic- 
tion weight type, the processor may acquire a prediction 35 
weight value, an offset value and motion vector data for the 
prediction pixel. The look-up table entry may be generated by 
the processor utilizing the prediction weight value and offset 
value for the prediction pixel. The generated look-up table 
entry may correspond to luminance and/ or chrominance com- 40 
ponents of the prediction pixel. The processor may index the 
look-up table entry utilizing the motion vector data and may 
estimate the current pixels utilizing the indexed look-up table 
entry. The motion vector data, prediction weight value and 
offset value may be stripped off by the processor from the 45 
encoded video stream. 

If the prediction weight type comprises an implicit predic- 
tion weight type, motion vector data for the prediction pixel 
may be acquired by the processor. A prediction weight value 
for the prediction pixel may be generated by the processor and 50 
the prediction weight value may be based on a temporal 
distance between plurality of pixels within the encoded video 
stream. The processor may generate the look-up table entry 
utilizing the prediction weight value and may estimate the 
current pixels utilizing the generated look-up table entry. 55 

These and other features and advantages of the present 
invention may be appreciated from a review of the following 
detailed description of the present invention, along with the 
accompanying figures in which like reference numerals refer 
to like parts throughout. 60 

BRIEF DESCRIPTION OF SEVERAL VIEWS OF 
THE DRAWINGS 

FIG. 1 is a block diagram of an exemplary weighted video 65 
processing scenario, in accordance with an embodiment of 
the invention. 
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FIG. 2 is a diagram illustrating an exemplary explicit pre- 
diction weighting table, in accordance with an embodiment of 
the invention. 

FIG. 3 is a diagram illustrating an exemplary implicit pre- 
diction weighting table, in accordance with an embodiment of 
the invention. 

FIG. 4 is a flow diagram illustrating an exemplary method 
for implementing an explicit prediction weighting table, in 
accordance with an embodiment of the invention. 

FIG. 5 is a flow diagram illustrating an exemplary method 
for implementing an implicit prediction weighting table, in 
accordance with an embodiment of the invention. 

FIG. 6 is a block diagram illustrating an exemplary system 
for video decoding utilizing motion compensation with 
weighted prediction, in accordance with an embodiment of 
the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

Certain aspects of the invention may be found in a method 
and system for processing pixels within an encoded video 
stream. During encoding of a video stream, prediction infor- 
mation may be encoded within the elementary video stream. 
For example, for explicitly encoded prediction information, 
motion vector information associated with one or more ref- 
erence pictures may be encoded within the elementary video 
stream, together with prediction weighting information and/ 
or offset information. For implicitly encoded video informa- 
tion, motion vector information may be encoded within the 
elementary video stream and additional prediction informa- 
tion may be generated utilizing the motion vector informa- 
tion. In one aspect of the invention, a single memory may be 
utilized to handle both explicit and implicit prediction infor- 
mation during decoding. 

For example, an implicit prediction weighting look-up 
table may be generated utilizing motion vector information 
obtained from the video stream. The implicit prediction 
weighting look-up table may comprise prediction weights for 
a plurality of reference pictures. Similarly, explicitly encoded 
prediction information may be extracted from the elementary 
video stream and a look-up table may be generated with 
corresponding prediction weight and offset information for a 
plurality of reference pictures used for decoding pixels within 
a current frame. After the look-up table is generated, the 
explicit prediction information may be removed from the 
elementary video stream to improve data flow and overall 
processing speed within the video decoder. 

In MPEG-4 video decoding, a partition may be predicted 
from blocks of pixels P in more than one field/frame. For 
bi-directionally coded bitstream, a partition may be predicted 
from two weighted blocks of pixels, P 0 and P l3 for example. 
Accordingly, a prediction error E may be calculated as the 
difference between the weighted average of the prediction 
blocks W 0 P 0 +W 1 P 1 and the partition. The prediction error E 
and an identification of the prediction blocks P 0 , P : may be 
encoded within the elementary video stream. The prediction 
blocks P 0 and P : may be identified by motion vectors MV. 
Prediction weights, or weighting factors, W 0 and W : may, for 
example, be encoded explicitly in the elementary video 
stream, or implied from an identification of the field(s)/frame 
(s) containing the prediction blocks P 0 and P : from a corre- 
sponding motion vector. The prediction factors W 0 and W : 
may, for example, be implied from a temporal distance 
between the frames/fields containing the prediction blocks P 0 
and P : and the frame/field containing the partition. In an 
exemplary scenario, where T 0 is the number of frame/field 
durations between the frame/field containing P 0 and the 
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frame/field containing the partition, and T 1 is the number of 
frame/field durations for P 1? the weighting factors may be 
determined as follows: W 0 =1-T 0 /(T 0 +T 1 ) and W 1 =l-T 1 / 
(T 0+ri ). More detailed information on bi-directional coding 
may be found in U.S. patent application Ser. No. 1 0/963,677, 5 
filed Oct. 13, 2004, the subject matter of which is hereby 
incorporated by reference in its entirety. 

FIG. 1 is a block diagram 100 of an exemplary weighted 
video processing scenario, in accordance with an embodi- 
ment of the invention. Referring to FIG. 1, there is illustrated 10 
a weighted prediction scenario of pixels within a current 
macroblock B in a current reference picture 2 utilizing pre- 
diction pixels in macroblocks A and C in a corresponding 
previous reference picture 0 and a subsequent reference pic- 
ture 5, respectively. 15 

Weighting prediction to obtain pixels within the current 
macroblock B may be a function of temporal distance 
between the current reference picture 2 and one or more 
previous or subsequent reference pictures. For example, in 
the exemplary two-reference scenario of FIG. 1, weighting 20 
may be a function of respective temporal distances between 
macroblocks A and C in reference pictures 0 and 5 and the 
current macroblock B in reference picture 2. In one aspect of 
the invention, weighting may be a mnction of a first temporal 
distance D : between the first reference picture 0 and the 25 
current picture 2, and a second temporal distance D 2 between 
the second reference picture 5 and the current picture 2 . In this 
manner, a first weighting factor W\ may be a function of 
D 2 /(D 1 +D 2 ), and a second weighting factor W 2 may be a 
function of D 1 /(D 1 +D 2 ). Even though the present invention 30 
contemplates the use of two references, the invention is not 
limited to a two-reference scenario or characteristics thereof. 
In addition, the invention is not limited to the use of a refer- 
ence frame, or reference picture, as a source of reference 
pixels for temporal prediction. More than two references may 35 
also be utilized, as well as units of pixels other than frames 
may be utilized as a source of reference pixels. 

In a different aspect of the invention, an output pixel within 
the current macroblock B may be estimated utilizing the 
following equation: 40 

outpel B='mpeiA*W A +'mpe\ C*JTc+(±) Offset, 

where inpel A and inpel C may be prediction pixels within 
macroblocks A and C, associated with prediction weighting 45 
factors and W c , respectively. An offset value may also be 
utilized to estimate the output pixel within the current mac- 
roblock B. The offset value may be a non-zero value if explicit 
weighting prediction is utilized, and it may be equal to zero if 
implicit weighting prediction is utilized. If explicit weighting 50 
is utilized to predict pixels within macroblock B, weighting 
factors and W c for pixels within macroblocks A and C 
may be encoded within the elementary video stream, together 
with a corresponding offset value. Motion vector information 
may also be encoded within the elementary video stream to 55 
indicate the corresponding frames that may be used for tem- 
poral prediction. 

If implicit weighting is utilized to predict pixels within 
macroblock B, only motion vector information may be 
encoded within the elementary video stream. Corresponding 60 
weighting factors may be generated based on the distance 
between the current frame and the reference frames associ- 
ated with the motion vector information. 

Implicit and/or explicit weighting prediction factors may 
be determined at various points during processing. In one 65 
exemplary embodiment of the invention, weighting factors 
may be determined at frame/field boundaries. Alternatively, 
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for example, weighting factors may be determined at slice or 
macroblock boundaries. A set of weighting factors may, for 
example, be determined whenever a plurality of potential 
reference pictures/frames for a current picture/frame 
changes. In addition, weighting factors may be determined by 
a processor executing software instructions, by hardware, or 
by a combination thereof. 

After prediction information, such as weighting factors, 
motion vectors, and/or offsets, is obtained from an elemen- 
tary video stream and/or determined based on information 
within the elementary video stream, the prediction informa- 
tion for a plurality of reference pictures may be stored in a 
look-up table. Such weighting information may, for example, 
be received, determined and/or calculated in real-time. Alter- 
natively, for example, such weighting prediction information, 
or various components thereof, may be pre-stored and 
retrieved in real-time. The look-up table may be updated or 
modified at various points during video information process- 
ing and in response to various events. In an exemplary sce- 
nario, where the look-up table is generated once per frame, 
the look-up table may be utilized during the processing of 
every macroblock in the frame before the look-up table is 
updated. 

An entry in an exemplary look-up table may comprise a 
single weighting factor or a plurality of weighting factors, for 
example. In an exemplary scenario comprising only a single 
weighting factor per reference picture, an additional weight- 
ing factor may be determined from the single stored weight- 
ing factor. For example, in a two-reference scenario, the look- 
up table may comprise a first weighting factor W 15 and a 
second corresponding weighting factor may be determined to 
be W 0 =l-W 1 . Alternatively, a prediction weighting table 
may comprise weighting and/or offset information corre- 
sponding to a plurality of possible reference pictures. For 
example, a current macroblock may be defined by a first and 
second motion vector. The first motion vector may comprise 
information based on a first reference picture, and the second 
motion vector may comprise information based on a second 
reference picture. 

FIG. 2 is a diagram illustrating an exemplary explicit pre- 
diction weighting table 200, in accordance with an embodi- 
ment of the invention. Referring to FIG. 2, the explicit pre- 
diction weighting table 200 may comprise prediction 
information for a plurality of reference pictures used for 
temporal prediction of pixels within a current frame. In addi- 
tion, since two references may be used to predict a current 
pixel, the weighting table 200 may comprise two columns of 
prediction information, a "0" column and a "1" column, with 
prediction information for pixels in two reference pictures. 
For example, the explicit prediction weighting table 200 may 
comprise prediction information for 32 pairs of reference 
pictures organized in a "0" column and a "1" column, or 
prediction information for a total of 64 reference pictures. 

In one aspect of the invention, each entry in the explicit 
prediction weighting table 200 may comprise 6 bytes that 
may be utilized, for example, to store weighting factor infor- 
mation [W (0 . z)y ; W (0 . z)t/ ; W (0 . z)K | and offset information [S (0 . 
or, S(- 0 . z)t/ ; S (0 . z)F ] for each of the luminance and chrominance 
components of pixels within a reference picture "i." The 
weighting factor information [W (0 . z - ):r ; W (0 . z)t/ ; W (0;z)K ] and 
offset information [S (0 . z)y ; S (0 .^ S (0 . z)K ] for each of the 
luminance and chrominance components of pixels within a 
reference picture "i"may be obtained from the elementary 
video stream and may be recorded in the explicit prediction 
weighting table prior to processing any pixel information. 
Each of the entries in the explicit prediction weighting table 
200 may be indexed by the reference picture number and/or 
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by the corresponding motion vector information obtained 
from the elementary video stream. After motion vector infor- 
mation is acquired and weighting factor information and off- 
set information are recorded as entries in the explicit predic- 
tion weighting table 200, such prediction information may be 5 
stripped off the elementary video stream. The stripping off of 
the prediction information from the elementary video stream 
may decrease the amount of information being processed in 
the video decoder and, therefore, improve the processing 
speed and overall decoding efficiency of the video decoder. 10 

In operation, prior to processing an elementary video 
stream, the explicit prediction weighting table 200 may be 
initially loaded with default prediction weighting and offset 
values. After an initial sequence of video frames and/or video 
slices, for example, is acquired from the elementary video 15 
stream, the explicit prediction weighting table may be gener- 
ated with weighting factor and offset information acquired 
from the elementary video stream. After a first reference 
picture is determined from corresponding motion vector, 
information of the first reference picture, such as the refer- 20 
ence picture number and/or a temporal distance information 
between the reference picture and the current picture, may be 
utilized as a table index to obtain weighting and/or offset 
information corresponding to the first reference picture from 
a "0" column in the explicit prediction weighting table 200. 25 
Similarly, after a second reference picture is determined from 
corresponding second motion vector, information pertaining 
to the second reference picture may be utilized to obtain 
weighting and/or offset information corresponding to the sec- 
ond reference picture from a "1" column in the explicit pre- 30 
diction weighting table 200. Information pertaining to the 
second reference picture may comprise a reference picture 
number and/or a temporal distance information between the 
reference picture and the current picture. 

FIG. 3 is a diagram illustrating an exemplary implicit pre- 35 
diction weighting table 300, in accordance with an embodi- 
ment of the invention. Referring to FIGS. 1 and 3, the implicit 
prediction weighting table 300 may comprise prediction 
information for a plurality of reference pictures used for 
temporal prediction of pixels within a current frame. For 40 
example, pixels within a current macroblock B in a current 
picture 2 of FIG. 1 may be estimated utilizing prediction 
pixels within a previous macroblock A in reference picture 0 
and pixels within a subsequent macroblock C in reference 
picture 5. Weighting factors for pixels within macroblocks A 45 
and C may be implicitly encoded within the elementary video 
stream. 

In operation, a video decoder may acquire motion vector 
information associated with reference pictures 0 and 5. Tem- 
poral distances, such as distances Dl and D2 in FIG. 1, may 50 
then be generated based on the acquired motion vector infor- 
mation. The motion vector information and the temporal dis- 
tance information may be utilized by the video decoder to 
generate the implicit prediction weighting table 300 of FIG. 
3. Entries within the implicit prediction weighting table 300 55 
may be separated into a "0" column and a "1" column based 
on the direction of a corresponding motion vector between a 
current picture and a reference picture. For example, the first 
row in the implicit prediction weighting table 300 may be 
utilized to store weighting factors for reference picture 0 60 
depending on whether a motion vector designates temporal 
movement direction from reference picture 0 to current pic- 
ture 2, reflected in column "0," or temporal movement direc- 
tion from current picture 2 to reference picture 0, reflected in 
column " 1 In this manner, the implicit prediction weighting 65 
table 300 may be order-dependent so that a table entry, or 
weighting factor, W (0 . 3) corresponding to indices (2:3) does 
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not necessarily contain the same information as weighting 
factor Wq. 3) corresponding to indices (3:2). A first table 
index, therefore, may correspond to a temporally preceding 
picture reference and a second successive table index may 
correspond to a temporally succeeding picture reference. 

In one aspect of the invention, a video decoder may acquire 
motion vector information from an elementary video stream 
and may generate an implicit prediction weighting table for a 
total of n reference pictures, as illustrated in FIG. 3. The 
generated implicit prediction weighting table may be utilized 
to provide weighting factor information for a specific mac- 
roblock, frame and/or slice. Entries within the generated 
implicit prediction weighting table, similarly to entries in the 
explicit prediction weighting table may be updated on the fly 
or after decoding a determined number of frames and/or 
video slices. 

A decoder utilizing motion compensation with weighted 
prediction may utilize prediction information within an 
elementary video stream and generate an explicit or implicit 
prediction weighting table that may be used to decode sub- 
sequent video frames and/or slices. Memory within the 
decoder, such as on-chip RAM, may be efficiently utilized for 
storing either an explicit or implicit prediction weighting 
table, depending on whether prediction information is explic- 
itly or implicitly encoded within the elementary video stream. 

After a prediction weighting table is generated with 
weighting factor and/or offset entries for specific reference 
pictures, pixels within a current macroblock or a current 
frame, for example, may be estimated utilizing pixels within 
previous and/or subsequent macroblocks and/or frames. In an 
exemplary aspect of the invention, the incoming elementary 
video stream may comprise weighting and/or offset informa- 
tion corresponding to a video slice. A video decoder may 
receive such information and may process the received infor- 
mation and place such information in a weighting table. Sub- 
sequent processing may then process blocks and sub-blocks 
within the slice by utilizing table information for the slice. 
The table may, for example, comprise weighting and/or offset 
information corresponding to various reference pictures. Pre- 
diction information within an explicit and/or implicit weight- 
ing information table may be updated at different intervals. 
For example, weighting information tables may be updated 
after a determined number of frames and/or a determined 
number of video slices. The present invention is not limited 
by the number of weighting factors that may be stored within 
an implicit and/or explicit prediction weighting table, nor is 
the present invention limited by the frequency at which the 
implicit or explicit prediction weighting table may be updated 
during decoding. 

FIG. 4 is a flow diagram illustrating an exemplary method 
400 for implementing an explicit prediction weighting table, 
in accordance with an embodiment of the invention. At 401, 
explicit prediction weighting factors, motion vector informa- 
tion and offset information may be acquired from an elemen- 
tary video stream. At 403, a table of explicit prediction 
weighting factors may be generated, where each table entry 
may comprise prediction weighting values and offset values 
for luminance and chrominance pixel components of a refer- 
ence picture. At 405, the acquired explicit weighting factors, 
motion vector information and offset information may be 
removed from the elementary video stream. At 407, the 
explicit prediction weighting factors from the generated table 
may be utilized to estimate a current pixel. 

FIG. 5 is a flow diagram illustrating an exemplary method 
500 for implementing an implicit prediction weighting table, 
in accordance with an embodiment of the invention. At 501, 
motion vector information may be acquired from an elemen- 
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tary video stream. At 503, temporal distance information 
between a current frame and a reference frame, or picture, 
may be determined from the acquired motion vector informa- 
tion. At 505, table index information and prediction weight- 
ing factors may be determined based on the temporal distance 5 
information. At 507, table index information may be utilized 
to retrieve weighting factors from the table. At 509, the 
retrieved weighting factors may be utilized during temporal 
prediction of a current pixel. 

FIG. 6 is a block diagram illustrating an exemplary system 10 

600 for video decoding utilizing motion compensation with 
weighted prediction, in accordance with an embodiment of 
the invention. Referring to FIG. 6, the video signal processing 
system 600 may comprise a video signal source 607, a video 
enabled device 601 and a display 604. 15 

The video signal source 607 may comprise a video encoder 
and may be adapted to generate an elementary video stream 
605. The video signal source 607 may utilize one or more 
video encoding standards, such as MPEG-4, for example, and 
may be implemented as a video head end, for example. The 20 
video signal source 607 may communicate the elementary 
video stream 605 to the video enabled device 601 for further 
processing, including decoding of the elementary video 
stream 605. The video signal source 607 may be connected to 
the video enabled device 601 via a wired and/or a wireless 25 
connection. 

The video enabled device 601 comprises suitable circuitry, 
logic and/or code and may be adapted to process an elemen- 
tary video stream 605. For example, the video enabled device 

601 may comprise a decoder 602 and may be adapted to 30 
decode the elementary video signal 605 to generate a decoded 
video signal 606. The video enabled device 601 may be 
implemented as a cable set top box, a satellite receiver box, a 
hand held device such as a PDA and/or cellular phone, a 
computer, and/or a digital antenna tuner, for example. 35 

In one aspect of the invention, the video enabled device 601 
may comprise a decoder 602. The decoder 602 may be 
adapted to decode the elementary video stream 605 utilizing 
a motion compensator 603. The motion compensator 603 
may be adapted to acquire temporally encoded prediction 40 
information, such as motion vector information, weighting 
factors and/or offset information that may be utilized by a 
central processing unit (CPU) 608 to predict pixels within a 
current macroblock. The CPU 608 may predict pixels within 
a current macroblock utilizing the acquired prediction infor- 45 
mation and pixels within a previous and/or a subsequent 
reference picture. 

After the elementary video stream 605 is decoded, the 
decoded video signal 606 may be communicated to a display 
604 for further processing. The display 604 may be imple- 50 
mented within a television or other video screen, for example, 
and may be adapted to display the decoded video signal 606. 

Accordingly, aspects of the invention may be realized in 
hardware, software, firmware or a combination thereof. The 
invention may be realized in a centralized fashion in at least 55 
one computer system, or in a distributed fashion where dif- 
ferent elements are spread across several interconnected com- 
puter systems. Any kind of computer system or other appa- 
ratus adapted for carrying out the methods described herein is 
suited. A typical combination of hardware, software and firm- 60 
ware may be a general-purpose computer system with a com- 
puter program that, when being loaded and executed, controls 
the computer system such that it carries out the methods 
described herein. 

Various functional aspects of the present invention may be 65 
performed by a processor executing software instructions, by 
hardware, or by a combination thereof. The explicit and/or 
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implicit prediction weighting tables may, for example, be 
implemented in various memory types, including but not 
limited to, on-chip and/or off-chip memory, local and/or non- 
local memory, SRAM and/or DRAM, volatile and/or non- 
volatile memory, etc. Further, various table implementations 
may be mirrored between various memory modules. For 
example, various table portions may be maintained in rela- 
tively low-speed memory and may be loaded into relatively 
high-speed memory for utilization. Additionally, for 
example, a first type of prediction weighting table may be 
implemented in a first memory module for a first type of 
elementary video stream, and a second type of prediction 
weighting table may be implemented in the first memory 
module for a second type of elementary video stream, such as 
in overlapping memory spaces or in separate memory spaces. 
Accordingly, the scope of various aspects of the present 
invention should not be limited by characteristics of particu- 
lar software and/or hardware implementations. 

One embodiment of the present invention may be imple- 
mented as a board level product, as a single chip, application 
specific integrated circuit (ASIC), or with varying levels inte- 
grated on a single chip with other portions of the system as 
separate components. The degree of integration of the system 
will primarily be determined by speed and cost consider- 
ations. Because of the sophisticated nature of modern proces- 
sors, it is possible to utilize a commercially available proces- 
sor, which may be implemented external to an ASIC 
implementation of the present system. Alternatively, if the 
processor is available as an ASIC core or logic block, then the 
commercially available processor may be implemented as 
part of an ASIC device with various functions implemented as 
firmware. 

The invention may also be embedded in a computer pro- 
gram product, which comprises all the features enabling the 
implementation of the methods described herein, and which 
when loaded in a computer system is able to carry out these 
methods. Computer program in the present context may 
mean, for example, any expression, in any language, code or 
notation, of a set of instructions intended to cause a system 
having an information processing capability to perform a 
particular function either directly or after either or both of the 
following: a) conversion to another language, code or nota- 
tion; b) reproduction in a different material form. However, 
other meanings of computer program within the understand- 
ing of those skilled in the art are also contemplated by the 
present invention. 

While the invention has been described with reference to 
certain embodiments, it will be understood by those skilled in 
the art that various changes may be made and equivalents may 
be substituted without departing from the scope of the present 
invention. In addition, many modifications may be made to 
adapt a particular situation or material to the teachings of the 
present invention without departing from its scope. There- 
fore, it is intended that the present invention not be limited to 
the particular embodiments disclosed, but that the present 
invention will include all embodiments falling within the 
scope of the appended claims. 

The invention claimed is: 

1 . A method for processing pixels within an encoded video 
stream, the method comprising: 

acquiring on a chip, a prediction weight type from the 
encoded video stream; 

acquiring on said chip, prediction weight data for at least 
one prediction pixel within the encoded video stream 
based on said acquired prediction weight type; 
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generating on said chip, at least one look-up table entry in 
a look-up table for said at least one prediction pixel 
utilizing said acquired prediction weight data; and 

estimating on said chip, at least a portion of a plurality of 
current pixels within the encoded video stream utilizing 5 
said generated at least one look-up table entry. 

2. The method according to claim 1, wherein said predic- 
tion weight type corresponds to a plurality of reference pic- 
tures within the encoded video stream. 

3. The method according to claim 1, comprising acquiring 10 
on said chip, at least one prediction weight value, offset value 
and motion vector data for said at least one prediction pixel, if 
said prediction weight type comprises an explicit prediction 
weight type. 

4. The method according to claim 3, comprising generating 1 5 
on said chip, said at least one look-up table entry utilizing said 

at least one prediction weight value and offset value for said 
at least one prediction pixel. 

5. The method according to claim 4, wherein said gener- 
ated at least one look-up table entry corresponds to one or 20 
both of luminance and/or chrominance components of said at 
least one prediction pixel. 

6. The method according to claim 4, comprising indexing 
said at least one look-up table entry utilizing said motion 
vector data. 25 

7. The method according to claim 6, comprising estimating 
on said chip, said at least a portion of said plurality of current 
pixels utilizing said indexed at least one look-up table entry. 

8. The method according to claim 3, comprising stripping 
off said acquired motion vector data, prediction weight value 30 
and offset value from the encoded video stream. 

9. The method according to claim 1, comprising acquiring 
on said chip, motion vector data for said at least one predic- 
tion pixel, if said prediction weight type comprises an implicit 
prediction weight type. 

10. The method according to claim 9, comprising generat- 
ing on said chip, at least one prediction weight value for said 
at least one prediction pixel. 

11. The method according to claim 10, wherein said gen- 4Q 
erated at least one prediction weight value is based on at least 
one temporal distance between portions of a plurality of pix- 
els within the encoded video stream. 

12. The method according to claim 10, comprising gener- 
ating on said chip, said at least one look-up table entry utiliz- 45 
ing said generated at least one prediction weight value. 

13. The method according to claim 12, comprising estimat- 
ing on said chip, said at least a portion of said plurality of 
current pixels utilizing said generated at least one look-up 
table entry. 50 

14. A computer-readable medium encoded with computer 
executable instructions for processing pixels within an 
encoded video stream to perform the steps comprising: 

acquiring on a chip, a prediction weight type from the 
encoded video stream; 55 

acquiring on said chip, prediction weight data for at least 
one prediction pixel within the encoded video stream 
based on said acquired prediction weight type; 

generating on said chip, at least one look-up table entry in 
a look-up table for said at least one prediction pixel 60 
utilizing said acquired prediction weight data; and 

estimating on said chip, at least a portion of a plurality of 
current pixels within the encoded video stream utilizing 
said generated at least one look-up table entry. 

15 . The computer-readable medium according to claim 14, 65 
wherein said prediction weight type corresponds to a plurality 
of reference pictures within the encoded video stream. 
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16. The computer-readable medium according to claim 14, 
comprising code for acquiring on said chip, at least one pre- 
diction weight value, offset value and motion vector data for 
said at least one prediction pixel, if said prediction weight 
type comprises an explicit prediction weight type. 

17. The computer-readable medium according to claim 16, 
comprising code for generating on said chip, said at least one 
look-up table entry utilizing said at least one prediction 
weight value and offset value for said at least one prediction 
pixel. 

18. The computer-readable medium according to claim 17, 
wherein said generated at least one look-up table entry cor- 
responds to one or both of luminance and/or chrominance 
components of said at least one prediction pixel. 

19. The computer-readable medium according to claim 17, 
comprising code for indexing said at least one look-up table 
entry utilizing said motion vector data. 

20. The computer-readable medium according to claim 19, 
comprising code for estimating on said chip, said at least a 
portion of said plurality of current pixels utilizing said 
indexed at least one look-up table entry. 

21. The computer-readable medium according to claim 16, 
comprising code for stripping off said acquired motion vector 
data, prediction weight value and offset value from the 
encoded video stream. 

22. The computer-readable medium according to claim 14, 
comprising code for acquiring on said chip, motion vector 
data for said at least one prediction pixel, if said prediction 
weight type comprises an implicit prediction weight type. 

23. The computer-readable medium according to claim 22, 
comprising code for generating on said chip, at least one 
prediction weight value for said at least one prediction pixel. 

24. The computer-readable medium according to claim 23, 
wherein said generated at least one prediction weight value is 
based on at least one temporal distance between portions of a 
plurality of pixels within the encoded video stream. 

25. The computer-readable medium according to claim 23, 
comprising code for generating on said chip, said at least one 
look-up table entry utilizing said generated at least one pre- 
diction weight value. 

26. The computer-readable medium according to claim 25 
comprising code for estimating on said chip, said at least a 
portion of said plurality of current pixels utilizing said gen- 
erated at least one look-up table entry. 

27. A system for processing pixels within an encoded video 
stream, the system comprising: 

at least one processor implemented on a chip that acquires 
on said chip, a prediction weight type from the encoded 
video stream; 

said at least one processor acquires on said chip, prediction 
weight data for at least one prediction pixel within the 
encoded video stream based on said acquired prediction 
weight type; 

said at least one processor generates on said chip, at least 
one look-up table entry in a look-up table for said at least 
one prediction pixel utilizing said acquired prediction 
weight data; and 

said at least one processor estimates on said chip, at least a 
portion of a plurality of current pixels within the 
encoded video stream utilizing said generated at least 
one look-up table entry. 

28. The system according to claim 27, wherein said pre- 
diction weight type corresponds to a plurality of reference 
pictures within the encoded video stream. 

29. The system according to claim 27, wherein said at least 
one processor acquires on said chip, at least one prediction 
weight value, offset value and motion vector data for said at 
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least one prediction pixel, if said prediction weight type com- 
prises an explicit prediction weight type. 

30. The system according to claim 29, wherein said at least 
one processor generates on said chip, said at least one look-up 
table entry utilizing said at least one prediction weight value 5 
and offset value for said at least one prediction pixel. 

31. The system according to claim 30, wherein said gen- 
erated at least one look-up table entry corresponds to one or 
both of luminance and/or chrominance components of said at 
least one prediction pixel. 10 

32. The system according to claim 30, wherein said at least 
one processor indexes said at least one look-up table entry 
utilizing said motion vector data. 

33. The system according to claim 32, wherein said at least 
one processor estimates on said chip, said at least a portion of 15 
said plurality of current pixels utilizing said indexed at least 
one look-up table entry. 

34. The system according to claim 29, wherein said at least 
one processor strips off said acquired motion vector data, 
prediction weight value and offset value from the encoded 20 
video stream. 



,637 B2 

14 

35. The system according to claim 27, wherein said at least 
one processor acquires on said chip, motion vector data for 
said at least one prediction pixel, if said prediction weight 
type comprises an implicit prediction weight type. 

36. The system according to claim 35, wherein said at least 
one processor generates on said chip, at least one prediction 
weight value for said at least one prediction pixel. 

37. The system according to claim 36, wherein said gen- 
erated at least one prediction weight value is based on at least 
one temporal distance between portions of a plurality of pix- 
els within the encoded video stream. 

38. The system according to claim 36, wherein said at least 
one processor generates on said chip, said at least one look-up 
table entry utilizing said generated at least one prediction 
weight value. 

39. The system according to claim 38, wherein said at least 
one processor estimates on said chip, said at least a portion of 
said plurality of current pixels utilizing said generated at least 
one look-up table entry. 
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